Parsing With Clause and Intra-clausal Coordination Detection

نویسندگان

  • Domen Marincic
  • Tomaz Sef
  • Matjaz Gams
چکیده

We present a new dependency parsing algorithm based on the decomposition of large sentences into smaller units such as clauses and intraclausal coordinations. For the identification of these units, new methods combining machine learning techniques and heuristic rules were developed. The algorithm was evaluated on the Slovene dependency treebank text corpus. Compared to the MSTP parser, currently the most accurate for Slovene, parsing accuracy was improved by 1.27 percentage points, which equals 6.4% relative error reduction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parsing Aided by Intra-Clausal Coordination Detection

We present an algorithm for parsing with detection of intra-clausal coordinations. The algorithm is based on machine learning techniques and helps to decompose a large parsing problem into several smaller ones. Its performance was tested on Slovene Dependency Treebank. Used together with the maximum spanning tree parsing algorithm it improved parsing accuracy.

متن کامل

Clausal parsing helps data-driven dependency parsing: Experiments with Hindi

This paper investigates clausal data-driven dependency parsing. We first motivate a clause as the minimal parsing unit by correlating interand intra-clausal relations with relation type, depth, arc length and non-projectivity. This insight leads to a two-stage formulation of parsing where intra-clausal relations are identified in the 1 stage and inter-clausal relations are identified in the 2 s...

متن کامل

Parsing with Intraclausal Coordination and Clause Detection

Syntactic analysis, i.e., parsing of text is used during various tasks, e.g., machine translation, question answering, etc. The structure of a sentence is represented with a tree. Parsing long sentences is a difficult task. The motivation was to analyze sub-units of the sentence independently, which could improve the overall parsing accuracy. We developed a new parsing algorithm that includes i...

متن کامل

Intraclausal Coordination and Clause Detection as a Preprocessing Step to Dependency Parsing

The impact of clause and intraclausal coordination detection to dependency parsing of Slovene is examined. New methods based on machine learning and heuristic rules are proposed for clause and intraclausal coordination detection. They were included in a new dependency parsing algorithm, PACID. For evaluation, Slovene dependency treebank was used. At parsing, 6.4% and 9.2 % relative error reduct...

متن کامل

Machine Translation Through Clausal Syntax : A Statistical Approach for Chinese to English by Dan Lowe Wheeler

Language pairs such as Chinese and English with largely differing word order have proved to be one of the greatest challenges in statistical machine translation. One reason is that such techniques usually work with sentences as flat strings of words, rather than explicitly attempting to parse any sort of hierarchical structural representation. Because even simple syntactic differences between l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computing and Informatics

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2012